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Consider an n qubit computational basis state corresponding to a bit string x, which has had an 
unknown local unitary applied to each qubit, and whose qubits have been reordered by an unknown 
permutation. We show that, given such a state with Hamming weight \x\ < L n /2J, it is possible to 
reconstruct \x\ with success probability 1 — |x'|/(n — \x\ + 1), and thus to compute any symmetric 
function of x. We give explicit algorithms for computing whether or not \x\ > t for some t, and for 
computing the parity of x, and show that these are essentially optimal. These results can be seen 
as generalisations of the swap test for comparing quantum states. 



I. INTRODUCTION 

Consider the following scenario. Alice is a physicist 
who has just completed a long quantum computing ex- 
periment. Her quantum computer has produced an n 
qubit state \x), corresponding to the bit string x. How- 
ever, before she can measure the state to determine x, she 
is called away from the lab. In her absence, Eve sneaks 
in and sabotages the experiment. First, she applies an 
arbitrary local rotation to the qubits (the same rotation 
on each qubit); she then rearranges all the qubits in an 
arbitrary order. 

It is clearly now hopeless for Alice to determine x ex- 
actly. Indeed, she cannot even determine an individual 
bit of x with any probability better than guessing. But 
what if she only needs to calculate f(x), for some func- 
tion /? Because of the arbitrary rearrangement of the 
qubits, she only has a chance of being able to compute 
symmetric functions, i.e. functions / where f{x) depends 
only on the Hamming weight of x. Also, because of 
the arbitrary local rotation, she can only compute func- 
tions / where f(x) = f(x), with x denoting bitwise nega- 
tion. This is equivalent to imposing the constraint that 

M < L»/2J. 

The purpose of this note is to show that Alice can in 
fact compute any f that satisfies these constraints (with 
some probability, which may be low in the worst case). 
Indeed, we have the following result. 

Theorem 1. Let x be an n-bit string with \x\ < \n/2\. 
Let U be an unknown and arbitrary single qubit unitary 
operator, and a be an unknown and arbitrary permuta- 
tion of n qubits. Then there is a procedure which, given 
a(U® n \x}), outputs \x\ correctly with probability 
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Assuming that \x\ is distributed uniformly at random, for 
large n this corresponds to an average probability of suc- 
cess of approximately 2(1 — In 2) rts 0.614. 
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Previous work has studied the closely related question 
of communicating without a shared reference frame 0]. 
In this setting, Alice wishes to communicate some (clas- 
sical or quantum) information to Bob by sending him n 
qubits, but Bob does not know Alice's basis for each of 
the qubits. The results of Q show that, by encoding 
across multiple qubits, Alice can send Bob a number of 
bits that approaches n, in the large n limit. By contrast, 
in the present work we do not allow prior encoding of 
Alice's information. Also note related previous work on 
the problem of computation in a hidden basis (l2| . 

Theorem [1] can be used to obtain procedures for com- 
puting any symmetric function of x. These results can 
be seen as generalisations of the problem of determin- 
ing whether n qubits are all in the same state [HI, [lj| , 
which is in turn a generalisation of the question of de- 
termining equality of two qubits, which can be solved 
using the well-known swap test [3, 0). For example, 
we have the following results for the threshold function 
Tn t (Tn f {x) = 1 <^> \x\ > t) and the parity function 
(Parity(x) =®iXi). 

Corollary 2. Let x be an n-bit string with \x\ < \n/2\. 
Let U be an unknown and arbitrary single qubit unitary 
operator, and a be an unknown and arbitrary permuta- 
tion of n qubits. Then there is a procedure which, given 
o~(U® n \x)), can compute Th ( (x) with success probability 
at least 1 — t/(n + 1), and can compute Parity(x) with 
probability at least 1/2 + l/(2(n + 1)). 

These success probabilities are essentially optimal, as 
we will show with the following theorem. 

Theorem 3. Let x be an n-bit string with \x\ < \n/2\. 
Let U be an unknown and arbitrary single qubit unitary 
operator, and a be an unknown and arbitrary permutation 
of n qubits. Let f(x) be some function such that f(k + 
1) f{k) for some < k < [n/2\ . Then any procedure 
that computes f(x) given access to a(U® n \x)) succeeds 
with probability at most 1 — {k + l)/(2(n — k)) in the 
worst case. 

This implies that, for example, one can determine 
whether or not \x\ = very effectively, but distinguish- 
ing \x\ = n/2 from \x\ = n/2 — 1 is hard. Interestingly, 
this phenomenon also occurs in the study of quantum 
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and classical query complexity [6| , and more generally in 
approximation theory [la . Section 3.4]. 



II. THE SYMMETRIC GROUP AND WEAK 
SCHUR SAMPLING 

The results in this note will be proven using some basic 
representation theory of the symmetric group S n , which 
we now outline. Conjugacy classes of S n are labelled by 
partitions Ahn. Each partition A containing k parts 
can be written as a non-increasing sequence of positive 
integers (Ai, . . . , A&), and expressed as a Young diagram. 
The diagram corresponding to the partition A is a col- 
lection of boxes arranged in left-justified rows, where the 
number of boxes in row i is given by A^. Irreducible rep- 
resentations (irreps) of the symmetric group are thus in 
one-to-one correspondence with Young diagrams. We let 
A denote both a partition and its corresponding diagram, 
and V\ denote the corresponding irrep. 



FIG. 1: The irreducible representations (3), (2,1), (1,1,1) of 
the group S3. 

We think of the input to our problem as an n-qubit 
state \x) in a known basis, to which an unknown, and 
arbitrary, tensor product unitary U® n has been applied, 
followed by an unknown permutation of the qubits a. 
In order to take advantage of these symmetries, we will 
use Schur-Weyl duality. This states that the space of n 
qudits decomposes into a direct sum of tensor products of 
subspaces corresponding to irreps of the symmetric and 
unitary groups, as follows: 



(1) 



Ahn 



where V\ and Qf correspond to irreps of S n and Ud, re- 
spectively. For good introductions to Schur-Weyl duality 
in the context of quantum information theory, see the 
theses 11 1 and @. 

The Schur transform [l], 0, El performs an implemen- 
tation of this decomposition, mapping a state in the com- 
putational basis to one of the form |A)|p)|g). In this case, 
as we are indifferent to permutations of the subsystems 
and local unitaries on each subsystem, we will only mea- 
sure the |A) register. This is known as weak Schur sam- 
pling Q. Theprojector onto a given value of A is given 
by (see or [B, Theorem 8]) 
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where d\ is the dimension of the irrep V\ , \\ i s the char- 
acter tr V\ , and D is the defining representation of S n 



that acts by permuting the n subsystems, 

D(w)\ii) ■ ■ ■ K) = • ■ ■ Iv-Hn))- 

It is unnecessary to perform the full Schur transform to 
measure A; it suffices to use the quantum Fourier trans- 
form over the symmetric group SL, in a procedure known 
as generalised phase estimation 0,03, which can be seen 
as a generalisation of the swap test @,0| ■ To perform gen- 
eralised phase estimation on an n qubit state p, one first 
prepares an ancilla register in the state ^ X^es I 71 ") - 
This register is used to control a conditional permutation 
of the subsystems of p, which is followed by an inverse 
quantum Fourier transform (over S„) on the ancilla reg- 
ister. Measuring the ancilla gives a value of A with prob- 
ability tr(P\p). This whole procedure can be performed 
efficiently, i.e. in time polynomial in n [f|. 

Let I a;) be the input state and assume that x has Ham- 
ming weight k < [ n /2j. Letting a be an arbitrary per- 
mutation of n qubits and U be an arbitrary local unitary, 
we now compute tr(P A D(a)U m \x)(x\(U^ n D{ay). As 
P\ commutes with local unitaries and permutations [|[, 
this is equal to tr(P\\x)(x\). Invariance under permu- 
tation also implies that the probability of obtaining a 
given outcome A depends only on the Hamming weight 
of x. The following crucial lemma allows us to write down 
exactly what these probabilities are. 

Lemma 4. Let Pr[£|/c] denote the probability of getting 
the measurement outcome corresponding to the partition 
(n — £, £) when performing weak Schur sampling on an 
n-bit string with Hamming weight k. Then, if £ > k, 
Pr[£|fc] = 0. Otherwise, 
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In particular, ~^2f =0 

Before we prove this lemma, we show that it implies 
the results stated in Section [TJ In the case of Theorem 
HJ we give an explicit algorithm that achieves the success 
probability required by the theorem: 

1. Perform weak Schur sampling, obtaining outcome 
A=(n -£,£). 

2. Output the guess that k~£. 

It is clear that this procedure, which we will term the 
standard algorithm, will output the correct answer with 
probability Pr[fe|fc] = 1 - k/(n -k + 1). 

One might consider more complicated strategics for in- 
ferring k from weak Schur sampling. A general inference 
strategy can be expressed as a matrix O, where Okt = 
Pr[output A:|get outcome {n — £,£)], and ^ fc Oki = 1 for 
all £. If one wishes to maximise the worst-case probabil- 
ity of outputting the correct value of k, for example, it is 
required to find an O that maximises 
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This is a linear programming problem and can be solved 
exactly for small n, although we do not know a closed 
form for the solution for general n. 

An alternative setting for the inference problem is the 
Bayesian scenario where one maximises the probability of 
success assuming an a priori probability distribution on 
k (see [Til l for a comprehensive introduction to Bayesian 
inference). Letting {pk} denote this probability distribu- 
tion, the problem is to maximise 




k < t, note that this algorithm succeeds with certainty, 
as Pr[/i|/c] = for I > k. We thus have a probabilis- 
tic algorithm that computes the threshold function Th ( 
with one-sided error. This can be modified to give an 
algorithm with small worst-case probability of error, as 
follows. 

Consider any procedure that attempts to compute an 
arbitrary boolean function f(k) from f(k), where k is 
picked to minimise the probability that f(k) = f{k). 
Such a procedure can be parametrised by two proba- 
bilities (jo, <Zi, where qo is the probability that the pro- 
cedure outputs 0, given that f(k) = 0, and q\ is the 
probability of outputting 0, given that f(k) = 1. Let 
Pi = Pr[/(fc) = 0|/(fe) = i] for i £ {0, 1}, and assume 
that po > pi . Then the probability of success of such a 
procedure (in the worst case) is at least 



This is clearly maximised by taking Om = 1 for k 



maxfc/p fc »/(£), and M = otherwise. In the partic- mm{q p + qi(l - Po), (1 - q )pi + (1 - qi)0- - Pi)}- 



ularly natural case where we assume that the a priori 
distribution on k is uniform, this maximisation in fact 
shows that the standard algorithm is optimal, and gives 
an average probability of success of 
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n- k + 1). 



For large n, this can be estimated as 
2 [ n l 2 

1 - - / k/(n-k + l)dk 
n Jo 



= 2ll-^ln 



n + 1 
n \n/2 + l 
« 2(1 - In 2) « 0.614. 

What about the scenario of Corollary [21 where we only 
want to compute some function f(k), rather than to out- 
put fe? It is natural to try to produce an algorithm with 
high success probability for all < k < [n/2\. Again, 
if one attempts to maximise this worst-case probability 
over all strategies that consist of performing weak Schur 
sampling and attempting to infer f(k) from the result, 
one is led to a linear programming problem for which we 
do not know a closed form solution. A more straightfor- 
ward approach is to guess k using the standard algorithm 
(call this guess k), and then to output f(k). 

The probability that this gives the right answer can 
easily be calculated for threshold functions TH t . Assum- 
ing k > t, 



Pr[/(A:) + f(k)] 



E 

1=0 
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V/-1/ 



We pick qi such that these two values are equal, which 
gives 



'7i 



1 - 9o(po +Pi) 
2 - Po - Pi 



Our goal is to maximise the corresponding expression for 
the probability of success, 



qoPo + qi (1 ~ Po) 



1 + 9o(Po - Pi) - Po 



Po-Pi 



over go, while still obeying the constraints < go, qi < L 
This is straightforward and gives the answer 



1 r 1 

, 1), qi = max{0, 



Po +Pi 



■Pa- Pi 
2-po-pi 



}■ 



which corresponds to a maximum worst-case probability 
of success of po/(po + Pi) when po + Pi > 1, and (1 — 
Pi)/(2 -p - pi) whenpo +Pi < 1- 

Applying this result to the threshold function THt, 
where po — 1 and p\ = t/(n — t + 1), it can easily be 
seen that we obtain a two-sided error algorithm that al- 
ways succeeds with probability at least 1 — t/(n +1), as 
stated in Corollary [2] 

In the case of the Parity function, we can calculate 
the probability that f(k) ^ f(k) from 



Pr[fc is even] — Pr[fc is odd] 



vkl e=o 



n 

£-1 



It is then immediate that 



This is clearly maximised by k = t, giving a failure prob- 
ability of at most t/(n — t + 1). On the other hand, if 
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and of course Pr[fc is odd] = 1 — Pr[/c is even]. Assume 
that n is even and n/2 is also even (the cases where n 
or n/2 is odd are analogous). This implies that, for k 
even, we have Pr[/(fc) = f(k)] > 1/2, and for k odd, 
Pr[/(A?) = /(&)] > 1/2 + 1/n. We can use the same 
technique as before to get an algorithm that succeeds 
with probability at least 1/2 + l/(2(n + 1)) in the worst 
case. 



III. LIMITS ON SUCCESS PROBABILITY 

To prove Theorem [3J and hence to show that these 
algorithms are almost optimal, consider the restricted 
problem of distinguishing a bit-string x with weight k 
from one with weight k + 1, as is required to compute a 
symmetric function / where f(k) ^ f{k + 1). 

We first argue that any procedure for computing / 
might as well simply consist of weak Schur sampling and 
post-processing the results. Imagine that an adversary, 
as in the scenario of Section [IJ has performed a random 
permutation and a random local rotation of each qubit 
on the initial state \x). Then, from Alice's perspective, 
the resulting state looks like 




dUU® n D(a)\x)(x\D(a)HU^ n , 



which is equal to X^Ahn ^aP\ for some coefficients {k\}. 
(This follows from the decomposition of cqn. ([T]) and 
Schur's Lemma, using the fact that p commutes with 
all permutations and local unitaries.) This implies that, 
without loss of generality, a measurement strategy can be 
taken as consisting of measuring A and performing some 
classical post-processing. 

Let pk {£) denote the probability distribution over par- 
titions (n—£,£) obtained by performing weak Schur sam- 
pling on an input with weight k. We calculate the l\ 
distance between the distributions pk, Pk+i for arbitrary 
< k < [n/2\: 




Using standard results on distinguishing probability dis- 
tributions, this distance puts an upper bound on the 
probability of success of any algorithm attempting to dis- 
tinguish between weights k and fc+1, and implies that the 
above algorithms are asymptotically optimal. We finally 
turn to the proof of Lemma [¥] 



IV. PROOF OF LEMMA H 

Let A be the partition (n — £, £) and let a; be a bit-string 
with Hamming weight k < [n/2\, assuming without loss 
of generality that \x) = 1 • • • 10 • ■ • 0), where the first 
k bits of x arc 1 and the last n — k are 0. We now 
calculate Pr[f|fc] = tr(P^|a;)(a;|) using ([2|). It is easy to 
see that tr(D(ir)\x)(x\) = unless it leaves the bit-string 
x unchanged, in which case tr(D(7r)|x)(x|) = 1. All such 
permutations 7r can be decomposed as a direct product 
of a permutation of the first k bits, and a permutation of 
the last n — k bits. This implies that 

tr(P A |x><*|) = ^tr( Y, ^W). 

\7reS fc xS„_ fc / 

We first calculate the sum over the group x SVi-fc ■ Note 
that the representation V\, while irreducible over S n7 is 
not necessarily irreducible over Sk x S n —k, but may split 
into a direct sum of irreps. The following simple lemma, 
which can be proven using Schur's Lemma, shows that 
only the trivial irrep is of interest. 

Lemma 5. Let V\ be an irreducible representation of a 
finite group G. Then 




\G\ ifV\is the trivial irrep 
otherwise. 



Each occurence of the trivial irrep in the decomposi- 
tion of the representation V\ over Sk x SVi-fc will thus 
give a contribution of k\(n — k)\ to the sum; all other ir- 
reps will contribute nothing. This number of occurrences 
can be calculated using a special case of the Littlewood- 
Richardson rule known as Pieri's formula [Io| . 

Let n be the diagram corresponding to the trivial irrep 
of Sk, for some k. Then, for any A and v, we define the 
Littlcwood-Richardson number as the number of 

ways that A can be expanded to v by adding k boxes 
to A, under the constraint that at most one new box is 
added to each column. See Figure [2] for an illustration 
of this process, and note that N\^ u is always either or 
1 (though this does not remain true in the more general 
setting where /x can be arbitrary; then the rule is more 
complicated). 



FIG. 2: Expanding the diagram (2, 1) to (3, 2) using the dia- 
gram (2). 

Littlcwood-Richardson numbers are relevant because 
of the following theorem [loj ]. 

Theorem 6. Let fj, be the partition (n — k). The multi- 
plicity of the irrep V\ ® in the restriction of the irrep 
V v from S n to Sk x S n -k is equal to Nx^ u . 
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As we are only interested in expansions of the trivial 
irrep (fc) by another trivial irrep (n — k), this multiplicity 
is particularly simple to calculate. Let vhc a. partition of 
n, A be the partition (k), and /x be the partition (n — k). 
Then, if v has more than two parts, Nx^ = 0. (This was 
expected anyway because each of the n subsystems we are 
dealing with has dimension 2.) If v has two parts, express 
it as (n -£,£). Then, if k < £, N\^ v = 0. Otherwise, 
N Xliv = 1. 

This deals with the sum; to finish the calculation, we 
need to find the dimension d\. This can be evaluated 
using the famous hook-length formula fiol ]. Let x be a 
box in a Young diagram. Then the hook-length h(x) is 
defined as the total number of boxes in the same row 
and to the right of x, plus the total number in the same 
column and below x, plus 1 (for x itself). See Figure [3] 
for an illustration. 
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FIG. 3: Hook-lengths of the cells in the diagram (4,2). 

The hook-length formula states that 

n\ 

As we only need calculate d\ for partitions A = (n — £, £), 
this formula is particularly simple, and gives 

_ n\(n-2£+ 1) _ /n\ /n-2£+l\ 
A ~ £\{n-£+l)\ ~\£J \ n-e+1 ) ' 

To sum up, we have, for £ < k, 



Pr[m = ^tr £ V x (n)\ 

' \TT€S k XS n - k J 

0) ( n-2£+l \ (g - Q2J 

where the last step is a binomial coefficient identity that 
can be verified directly. For £ > k, by Pieri's formula the 
sum over Sk x SVi-fc is zero. This implies that Pr[^|fc] = 
in this case, and completes the proof. 

V. CONCLUSION 

We conclude that it is possible to compute symmetric 
functions of an n qubit state \x), even if a malicious ad- 
versary has applied an arbitrary local rotation and an ar- 
bitrary permutation to the state, even without any prior 
encoding of x. This is in (perhaps surprising) contrast to 
the fact that any individual bit of x cannot be retrieved. 
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